The E cacy of GlOSS for the Text Database Discovery Problem
نویسندگان
چکیده
The popularity of information retrieval has led users to a new problem: nding which text databases (out of thousands of candidate choices) are the most relevant to a user. Answering a given query with a list of relevant databases is the text database discovery problem. The rst part of this paper presents a practical method for attacking this problem based on estimating the result size of a query and a database. The method is termed GlOSS-Glossary of Servers Server. The second part of this paper evaluates GlOSS using four di erent semantics to answer a user's queries. Real users' queries were used in the experiments. We also describe several variations of GlOSS and compare their e cacy. In addition, we analyze the storage cost of our approach to the problem.
منابع مشابه
The E ectiveness of GlOSS for the Text Database Discovery Problem
The popularity of on line document databases has led to a new problem nding which text databases out of many candidate choices are the most relevant to a user Identifying the relevant databases for a given query is the text database discovery problem The rst part of this paper presents a practical solution based on estimating the result size of a query and a database The method is termed GlOSS ...
متن کاملPrecision and Recall of GlOSS Estimators for Database Discovery
On line information vendors o er access to multi ple databases In addition the advent of a variety of INTERNET tools has provided easy distributed access to many more databases The result is thou sands of text databases from which a user may choose for a given information need a user query This pa per an abridged version of presents a framework for and analyzes a solution to this problem which ...
متن کاملThe Effects of L1 and L2 Glossing on the Retention of L2 Vocabulary in Intentional and Incidental Settings
The current study investigated the effects of L1 and L2 glosses on L2 vocabulary retention in incidental and intentional settings. To this end, 100 intermediate Iranian female learners of English as a foreign language at Soroosh High School were given a pre-test to make sure that they do not have any prior knowledge of the target words. Reading passages with three different glossing conditions ...
متن کاملThe Effects of Glossing Conventions on L2 Vocabulary Recognition and Production
To investigate the effects of different glossing conventions on vocabulary recognition and recall, 158 participants were given a pre-test to make sure that they did not have any prior knowledge of the target words. Reading passages with four different glossing conventions (interlinear, marginal, pre-text, and post-text) were given to eight groups. Four groups received interlingual glosses and f...
متن کاملارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متنکاوی در حوزه یادگیری الکترونیکی
As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...
متن کامل